The XQueC Project: Compressing and Querying XML
نویسندگان
چکیده
We outline in this paper the main contributions of the XQueC project. XQueC, namely XQuery processor and Compressor, is the first compression tool to seamlessly allow XQuery queries in the compressed domain. It includes a set of data structures, that basically shred the XML document into suitable chunks linked to each other, thus disagreeing with the ’homomorphic’ principle so far adopted in previous XML compressors. According to this principle, the compressed document is homomorphic to the original document. Moreover, in order to avoid the time consumption due to compressing and decompressing intermediate query results, XQueC applies ‘lazy’ decompression by issuing the queries directly in the compressed domain. Terms: XML databases, XML compression
منابع مشابه
Efficient Query Evaluation over Compressed XML Data
XML suffers from the major limitation of high redundancy. Even if compression can be beneficial for XML data, however, once compressed, the data can be seldom browsed and queried in an efficient way. To address this problem, we propose XQueC, an [XQue]ry processor and [C]ompressor, which covers a large set of XQuery queries in the compressed domain. We shred compressed XML into suitable data st...
متن کاملOptimizing XML Compression in XQueC
We present our approach to the problem of optimizing compression choices in the context of the XQueC compressed XML database system. In XQueC, data items are aggregated into containers, which are further grouped to be compressed together. This way, XQueC is able to exploit data commonalities and to perform query evaluation in the compressed domain, with the aim of improving both compression and...
متن کاملXquec: Pushing Queries to Compressed XML Data
Initially proposed as a data interchange format, XML aims also at becoming a format for data storage and management. However, XML documents in their textual form are rather verbose and tend to predate disk space, due to the textual and repetitive nature of the XML tags and of several XML types. One solution to this space occupancy problem consists of compressing XML. The XMill project [7] propo...
متن کاملYAQCX: A Word-based Query-aware Compressor for XML Data
XML has become a de facto standard for data exchanging over the Internet. However, efficiently storing and querying XML data is still an open problem. In this paper we present YAQCX, Yet Another Query-aware Compressor for XML. YAQCX adopts word-based modeling combined with byte-coding to provide a very efficient approach to compressing/decompressing and querying XML data. It also implements a s...
متن کاملMetadata & Information Management Issues in XML-Based Mediation
The advancement in XML-based mediation has made a significant impact on the area of resource discovery. Search engines have now been provided with new ways to improve resource discovery and new tools to customise resulting content. In the early days of XML, this work was undertaken within the context of the European funded project GESTALT (Getting Educational System Talk Across Leading Edge Tec...
متن کامل